A Parameterized and Annotated Spoken Dialog Corpus of the CMU Let's Go Bus Information System

نویسندگان

  • Alexander Schmitt
  • Stefan Ultes
  • Wolfgang Minker
چکیده

Standardized corpora are the foundation for spoken language research. In this work, we introduce an annotated and standardized corpus in the Spoken Dialog Systems (SDS) domain. Data from the Let’s Go Bus Information System from the Carnegie Mellon University in Pittsburgh has been formatted, parameterized and annotated with quality, emotion, and task success labels containing 347 dialogs with 9,083 system-user exchanges. A total of 46 parameters have been derived automatically and semi-automatically from Automatic Speech Recognition (ASR), Spoken Language Understanding (SLU) and Dialog Manager (DM) properties. To each spoken user utterance an emotion label from the set garbage, non-angry, slightly angry, very angry has been assigned. In addition, a manual annotation of Interaction Quality (IQ) on the exchange level has been performed with three raters achieving a κ value of 0.54. The IQ score expresses the quality of the interaction up to each system-user exchange on a score from 1-5. The presented corpus is intended as a standardized basis for classification and evaluation tasks regarding task success prediction, dialog quality estimation or emotion recognition to foster comparability between different approaches on these fields.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Demonstration of AT&T "Let's Go": A production-grade statistical spoken dialog system

This is a demonstration of the AT&T “Let’s Go” bus timetable spoken dialog system. This system was entered in the 2010 Spoken Dialog Challenge [1], where the task is to provide bus timetable information for Pittsburgh, Pennsylvania. Our primary aim in the challenge was to build a statistical spoken dialog system to comemrcial production standards, both in terms of user interface, and also in te...

متن کامل

Let's go lab: a platform for evaluation of spoken dialog systems with real world users

This short paper is intended to advertise Let’s Go Lab, a platform for the evaluation of spoken dialog research. Unlike other dialog platforms, in addition to example dialog data and a portable software system, Let’s Go Lab affords evaluation with real users. Let’s Go has served the Pittsburgh public with bus schedule information since 2005, answering more than 52,000 calls to date.

متن کامل

Dynamic language modeling using Bayesian networks for spoken dialog systems

We introduce a new framework employing statistical language models (SLMs) for spoken dialog systems that facilitates the dynamic update of word probabilities based on dialog history. In combination with traditional state-dependent SLMs, we use a Bayesian Network to capture dependencies between user goal concepts and compute accurate distributions over words that express these concepts. This all...

متن کامل

Let's go public! taking a spoken dialog system to the real world

In this paper, we describe how a research spoken dialog system was made available to the general public. The Let’s Go Public spoken dialog system provides bus schedule information to the Pittsburgh population during off-peak times. This paper describes the changes necessary to make the system usable for the general public and presents analysis of the calls and strategies we have used to ensure ...

متن کامل

Building Practical Spoken Dialog Systems

This tutorial will give a practical description of the free software Carnegie Mellon Olympus 2 Spoken Dialog Architecture. Building real working dialog systems that are robust enough for the general public to use is difficult. Most frequently, the functionality of the conversations is severely limited down to simple question-answer pairs. While offthe-shelf toolkits help the development of such...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012